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Abstract: This paper presents an approach for early breast cancer diagnostic by employing newer member of multiwavelet family: 
Balanced Multiwavelets. Detection and classification of microcalcification cluster is based on subband image decomposition. 
Detection of microcalcification is achieved by decomposing the mammogram and then reconstructing it from from the subbands 
containing only high frequency components. For this type of approach, we have applied different type of balanced multiwavelets for 
feature extraction. We used these results as an input to classification system. The proposed methodology is tested using 
Mammographic Image Analysis Society (MIAS) database. Results are presented as the receiver operating charactesistic (ROC) 


performance and quantified by the area under the ROC curve. 
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INTRODUCTION 

Breast cancer is the type of cancer with the highest 
incidence rates in women. It is also the most common 
cause of cancer death in women in many countries like 
Asian countries and United States [1]. The early 
detection of breast cancer is vital to improve its 
prognosis. Moreover, it is well known that screening 
mammography is the best tool available for detection 
of cancerous lesions before clinical symptoms appear. 
Clustered microcalcifications are one of the earliest 
signs of potential cancerous changes in breast tissue. A 
microcalcification is a small calcium deposit that has 
accumulated in breast tissue, and it appears as small 
bright spot on the mammogram. Benign calcifications 
are generally larger, more rounded, smaller in number, 
more diffusely distributed, and more homogeneous in 
shape and size. Malignant calcifications are typically 
very numerous, clustered, small, dot like or elongated, 
variable in size and shape. However, because of small 
size of microcalcifications, the characterization of 
benign and malignant lesions represents a very 
complex problem even for expert radiologist. 


In this work, we present a system for breast cancer 
diagnosis, and identification of microcalcification 
clusters in the digitized mammographic image and its 
classification as benign or malignant. To achieve the 
best results, we have employed different types of 
balanced multiwavelets such as BATO1, BATO2, 
CARDBAL2, and CARDBAL3. It is a well known fact 
that multi-scale representation has proven to be useful 
in many image processing applications and wavelet 
analysis is one way to generate such a representation. 
A newer alternative to wavelet analysis is the 
multiwavelet analysis which is very similar to wavelet 
analysis but has certain advantages those are proven to 
be more useful in image processing [2]. Because of the 
necessity of pre and post filtering of the image signal 
and its design constraints [3], researchers are giving 
attention to the balanced Multiwavelets. The concept 
of balancing was introduced in [4][5], extended to 
higher orders in [6][7], and further studied in [8]. In 
this paper, we have tried to elucidate the better 


performance of balanced multiwavelets in image 
processing applications such as feature extraction. We 
have attempted to use first, second, and third order 
balanced multiwavelets for feature extraction of 
mammographic images. This paper is organized as 
follows. Section II briefly presents the background 
survey of balanced multiwavelets. Methodology 
adopted and the algorithm implemented is covered in 
section III. Section IV focuses on results and related 
discussions. 


BALANCED MULTIWAVELETS 

It is known that multiwavelet filter bank is 
fundamentally a MIMO system. The problem of 
unbalanced channels used to split input signal is 
partially solved by adding some pre and post filters in 
the MIMO system. In pre-filtering, critical or non 
critical sampling is used which again destroys either 
the orthogonality or linear phase of the system or add 
redundancy. Hence, one can use _ orthogonal 
multiwavelet with good balance between the two 
scaling functions. 


Balancing: In orthonormal case, defining the band- 
Toeplitz matrix corresponding to the low pass analysis, 


L=| — M[0] Mf1] M[2] M[3]....... 
M{0] M[1] ....-. 


(1) 


where {M[k]}, is a sequence of r x r matrices of real 
coefficients. The condition imposed for balancing is 
that low pass synthesis operator L’ of the time varying 
filter bank based on this multiwavelet shall preserve 
the eigen signal [... 1, 1,1, 1, ...Jie. L[... 1, 1, 1, 
1 capa T= decors | Ee a: Cane ]'.. There are two 
ways to construct balanced multiwavelets. A simple 
way to construct balanced multiwavelets is to derive 
them from the complex Daubechies filters [9][10]. 
Another interesting way of constructing balanced 
multiwavelets is to balance already existing 
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unbalanced multiwavelets like the ones constructed in 
[11] [12]. Ultimately the aim of the balanced 
multiwavelet concept is to avoid intricate step of pre- 
filtering in multiwavelet based systems; that do not 
satisfy the interpolation / approximation properties of 
balancing. Implementation of balanced 
multiwavelets:; We have implemented balanced 
multiwavelets (BMW) of compact support, with 
flipped scaling functions and symmetric / anti- 
symmetric wavelets for order one and two. These 
BMWs (BATO1, BATO2) are shown in fig. 1. The 
importance of balancing is also discussed by Selesnick 
[8] that the smoothness of scaling functions generated 
by h(n) and those by h(n-1) is entirely different. It is 


fistseaingturetin = ggg son ating tet 
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shown that GHM scaling function generated by h(n) 
are continuous and almost differential where as those 
generated by h(n-1) are extremely non-smooth as 
shown in fig. 2. The same behavior is evident for other 
unbalanced multiwavelets. In the same paper [8], it is 
also shown that the minimal length K-balanced 
multiwavelet bases based on even length symmetric 
FIR filters are better behaved than those based on odd- 
length symmetric FIR filters. For order 2 balancing, 
the minimal length of ho and h, are 8 and 12, supported 
on n=(0,..., 7) and n=(0,..., 11), respectively. 
Similarly order 3 balanced multiwavelets are of 12 and 
16 minimal length. Fig. 3 show order 2 and 3 balanced 
multiwavelets. 
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Fig. 1 BATO1 (e), (f) Scaling functions (g), (h) Multiwavelet 
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Fig. 3 CARDBAL2 (a), (b) Scaling 
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Fig. 3 CARDBAL3 (e), (f) Scaling functions (g), (h) Multiwavelet 


METHODOLOGY ADOPTED 
Mammogram database is obtained from MIAS society 
developed by John Suckling [13] from which we have 


used 80 mammograms. Each image contains one or 
more clusters of microcalcifications verified by expert 
radiologists who have also marked the locations of 
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microcalcifications. Amongst 80 mammograms used 
for classification, 44 contain malignant lesions and 36 
contain benign lesions. 


Multiwavelet analysis is one way to generate multi- 
scale representation of the signal which is vital in 
image processing applications. Fig. 4 shows the final 
result of multiwavelet decomposition where first two 
columns and rows correspond to low pass filters (Ly, 
L,), and the next columns and rows correspond to high 
filters (H,, H2). Thus we decomposed each image to 16 
sub-images using each of the BMW. Fig. 5 shows 16 
sub-images of a typical cluster of microcalcification 
decomposed to first level using CARDBAL3 BMW. 
The method for feature extraction is based on 
decomposition of image and calculation of energy and 
entropy for image of each sub-band [14]. In an N * N 
sub-image, normalized energy and entropy are 
computed according to the following relations: 
L, L, H, HH, 


L,| LL; LL, L,H, L,H, 
L,| LL; Lil, L,H, Ly Hp 
H,} HL; H,L, H,H, HH, 
H,} Hol; H,L, | HoH, HH, 


Fig. 4 Image sub-bands after first level decomposition. 


Algorithm: 

Detection of microcalcification cluster 

A <¢— _— Read mammogram image 

[mn] <— size of the image 

For each pixel in Aj {i=1, ...,.m; j=l, ....n} 

Apply median filter with kernel of 3x3 size 

End 

C <+— = Multiwavelet transform (Aj) with BMW 
[pq] <+— sizeofC 

C[1:p/4, 1:q/4] <q — zeros (p/4, q/4); make low pass 
coefficients zero 

D <¢— Inverse multiwavelet transform (C,,) 

thl <«— threshold value for gray to binary 
conversion [16] 


For each pixelin Dj {i=1, ...,p; j=l, ....q} 


if (D, < thl) 
Dj = 0; 
else 
Dj = 1; 
end 


Dj © <— binary image showing microcalcification 
cluster 
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di LIX ij 
Energy = ------------ (2) 
N2 
Entropy = 
Yi Dj [x7 ij/ Norm’) log > [x’;;/ Norm] 
FO ergs Pee ye eee (3) 


log »N” 
where x ;; is the i j'" pixel value of the sub-image and, 
Norm?= vi Vix i 


We generated 16 sub-images for every cluster using 
one level multiwavelet decomposition. As a result, we 
get 32 features for each microcalcification cluster. 


Fig. 5 Result after first level decomposition of mammogram 
image using balanced multiwavelet 


Classification of microcalcification cluster 

Aj  <«— image after median filtering 

By;  <— gradient filtering (Ajj) 

C <— Multiwavelet transform (B,,) with BMW 
+ subimage of C (x=1, 2, 3, 4; y=1, 2, 3, 4) 
For each subimage D,, 

Energy,y4— Energy as per Eq. 2 

Entropyxy “— Entropy as per Eq. 3 


end 

Energyaye «— sum of Energy,, (x=1, 2, 3, 4; 
y=1, 2,3, 4)/ 16 

Entropyavg <«— sum of Entropy, (x=1, 2, 3, 4; 
y=1, 2,3, 4)/ 16 


thl <q threshold value for energy feature [16] 
th2 threshold value for entropy feature 
Classifier [16] 

{ 

input<q— thl, th2, Energy,,,., and Entropy, 
output «—Benign / Malignant cluster 


Fig. 6 psuecode of our system. 
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RESULTS AND DISCUSSIONS LiL; LiL, LH, LH, 
As mentioned earlier, we have used _ balanced (Ell) (12) (E13) (14) 
multiwavelets for extraction of features to be used by 
classification module. There are 32 features obtained LL ijl, | lai LH 
from 16 sub-images. (E21) (E22) (E23) (E24) 


Ail; | Hil, | HiM | HiH2 


Tables | to 4 show energy and entropy features where (E31) (632) (E33) (E34) 

E11 corresponds to the energy or entropy value of the 

sub-image in the first row and first column of image Hol) HoL» HH HH» 

shown in fig. 7 (E41) (E42) (E43) (E44) 
Fig. 7 Relation between decomposed sub-image and 
corresponding energy / entropy symbol. 


Table 1 Energy data obtained using CARDBAL3 balanced multiwavelet applied to images. 


E14 
E23 
E24 


E41 


Table 2 Entropy data obtained using CARDBAL3 balanced multiwavelet applied to images. 
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Table 3 Energy data obtained using CARDBAL3 balanced multiwavelet applied to images after gradient filtering. 


[ [amor [Mioots Twiauos2 [wanes wzser [a2 [am] 
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Table 4 Average of Energy data of table used for classification. 


11.45 136.53 79.01 272.06 243.97 280.04 261.62 


Mdb001 Mdb002 Mdb005-2 Mdb238 Mdb239-1 Mdb239-2 | Mdb241 
27.28 222.30 237.13 489.69 386.33 606.42 406.49 
Table 5 Classification of clusters on five point scale (CARDBAL3 balanced multiwavelet) 
Reference Radiologist’s interpretation 
standard Definitely Probably Possibly Probably Definitely Total 
results benign benign malignant malignant malignant 
Benign 10 11 8 6 1 36 
Malignant 2 3 6 20 13 44 
Total 12 14 14 26 14 80 


Table 6 Sensitivity, Specificity, and FPR for diagnosis and classification system of microcalcification clusters. 


Test positive if greater than or False Positive 


Sensitivity 


Specificity 


equal to Ratio (FPR) 
2: Probably benign 0.955 (42/44) 0.278 (10/36) 0.722 
3: Possibly malignant 0.886 (39/44) 0.583 (21/36) 0.417 
4: Probably malignant 0.75. (33/44) 0.806 (29/36) 0.194 
5: Definitely malignant 0.295 (13/44) 0.972 (35/36) 0.028 


Note:- These data are obtained from the results in table. FPR is 1 minus specificity. 
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Fig. 8 ROC curves obtained from system corresponding to balanced multiwavelets. 
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From Tables 1 to 4, it is evident that energy and 
entropy data can be used for classification of clusters. 
Microcalcification cluster images are obtained by 
selecting ROI from digitized mammogram images in 
which mdb001, mdb002, and mdb005-2 are benign 
lesions and mdb238, mdb239-1, mdb239-2, and 
mdb241 contain malignant lesions. Though energy and 
entropy data varies from image to image, the average 
of each image data is more suitable for classification, 
as seen from Table 4. 


ROC analysis: Sensitivity and specificity, are defined 
as the number of true positive decisions / the number 
of actually positive cases and the number of true 
negative decisions / the number of actual negative 
cases, respectively. Sensitivity and specificity 
constitute the basic measures of performance of 
diagnostic tests [15]. Amongst 80 mammograms, 44 
patients actually had malignancy and 36 patients had 
benign lesions; according to radiologist’s interpretation 
given with the database. Lesions were interpreted 
according to five point scale: | (definitely benign), 2 
(probably benign), 3 (possibly malignant), 4 (probably 
malignant), and 5 (definitely malignant). The result of 
this five point scale is shown in Table 5 and Table 6 
shows four pairs of sensitivity / specificity values for 
ROC curve. Area under ROC curve is a combined 
measure of sensitivity and specificity. The ROC curves 
of our classification system are shown in the fig. 8 in 
which red colored curve is_ obtained using 
CARDBAL3, blue colored curve from CARDBAL2, 
green colored from BATO2, and magenta colored from 
BATO1 balanced multiwavelets. The area under ROC 
curve is computed which indicates that the 
CARDBALS3 balanced multiwavelet has proven to be a 
better choice for classification of microcalcification 
clusters. 


Conclusion 

Intelligent system has been developed for the 
identification and classification of microcalcification 
clusters in digitized mammograms. This system is 
meant as a second opinion to assist radiologists for 
better interpretation of mammograms and to enhance 
the diagnosis reliability. The method employs balanced 
multiwavelets for experimentation on MIAS database 
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(c) 
Fig. 9 (a) Original Mammogram (b) Processed mammogram using BMW (c) Binary image 


of mammograms. As seen from the area under ROC 
curve, CARDBAL3 balanced multiwavelet with area 
ranging around 0.8398, outperforms the other BMWs 
with areas CARDBAL2 (0.8007), BATO2 (0.7757), 
and BATO1 (0.762). The system successfully combines 
intelligent methods and image processing techniques to 
enhance mammographic diagnosis sensitivity (Fig. 9). 
The proposed methodology could be a part of an 
integrated CAD system which could assist radiologists 
in mammogram analysis and diagnosis decision 
making. 
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